Using Spectrogram Reading Knowledge and Neural Networks

نویسندگان

  • Takaharu TANAKA
  • Takeshi KA
چکیده

We present a method for phoneme recognition using an expert system combining spectrogram reading knowledge and neural networks, and we report its performance. The proposed expert system consists of two parts : (1) phoneme segmentation based on spectrogram reading knowledge used by human experts, and (2) phoneme identification using neural networks applied to the phoneme boundaries determined in phoneme segmentation. Highly accurate phoneme segmentation can be achieved by using humanlike contextual spectrogram reading knowledge. Moreover, high performance phoneme identification can be achieved by applying neural networks to the accurate phoneme segmentation result. The system was tested on Japanese consonants, with 90.8% ofthe phonemes correctly segmented and 92.4% of the phonemes correctly identified within the correct segment. 83.9% of the phonemes were correctly recognized bothin segmentation and identification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Formalizing knowledge used in spectrogram reading: acoustic and perceptual evidence from stops

Since the invention of the sound spectrograph in 1946 by Koenig, Dunn and Lacey, spectrograms have been widely used for speech research. Over the last decade there has been revived interest in the application of spectrogram reading toward continuous speech recognition. Spectrogram reading involves interpreting the acoustic patterns in the image to determine the spoken utterance. One must select...

متن کامل

Polyphonic music transcription through dynamic networks and spectral pattern identification∗

The automatic extraction of the notes that were played in a digital musical signal (automatic music transcription) is an open problem. A number of techniques have been applied to solve it without concluding results. This work tries to pose it through the identification of the spectral pattern of a given instrument in the signal spectrogram using time-delay neural networks. We will work in the m...

متن کامل

The Effect of Mobile Social Networking on the Reading Habit of the Student Teachers of Frahangian University in South Khorasan Province

Purpose: Since today the use of mobile phones is extremely widespread, and individuals with different age groups and social classes devote a considerable amount of time to using these social networks, training the use of these networks can provide a platform for improving reading habits among young people. Therefore, the purpose of this study was to investigate the effect of Telegram messenger ...

متن کامل

Automatic Tagging Using Deep Convolutional Neural Networks

We present a content-based automatic music tagging algorithm using fully convolutional neural networks (FCNs). We evaluate different architectures consisting of 2D convolutional layers and subsampling layers only. In the experiments, we measure the AUC-ROC scores of the architectures with different complexities and input types using the MagnaTagATune dataset, where a 4-layer architecture shows ...

متن کامل

Flood Forecasting Using Artificial Neural Networks: an Application of Multi-Model Data Fusion technique

Floods are among the natural disasters that cause human hardship and economic loss. Establishing a viable flood forecasting and warning system for communities at risk can mitigate these adverse effects. However, establishing an accurate flood forecasting system is still challenging due to the lack of knowledge about the effective variables in forecasting. The present study has indicated that th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006